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Abstract 

We study the expected value of the length L n of the minimum spanning tree of the complete 
graph K n when each edge e is given an independent uniform [0, 1] edge weight. We sharpen the 
result of Frieze [5] that lim rwoC) E(L„) = £(3) and show that E(L n ) = £(3) + 77 + (^) where 
0.025 < c\ < 0.040 is an explicitly defined constant. 



1 Introduction 

We study the expected value of the length L n of the minimum spanning tree of the complete graph 
K n when each edge e is given an independent uniform [0, 1] edge weight X e . It was shown in Frieze 
[5] that 



lim E(L n ) = C(3) = V T3 = 1.202 .. . (1) 

k=l 

Since then there have been several generalisations and improvements. Steele [2] extended the 
applicability of ([T]) distribution-wise. Janson [9] proved a central limit theorem for L n . Penrose 
|12j . Frieze and McDiarmid [6], Beveridge, Frieze and McDiarmid PQ, Frieze, Ruszinko and Thoma 
[7] analysed L n for graphs other than the complete graph. Fill and Steele [3] used the Tutte 
polynomial to compute E e:rp (L n ) exactly for small values and Gamarnik [8] computed E eX p(L n ) 
exactly up to n < 45 using a more efficient algorithm. The distribution of the X e in these last two 
papers is exponential with mean one. Li and Zhang [11 j prove that 

E exp (L n ) - E(L n ) = ^ + O • (2) 

Flaxman gives an upper bound on the lower tail of L n . 

Equation ([TJ says that E(L n ) = ^(3) + o(l) as n — > 00. Ideally, one would like to have an exact 
expansion for E(L n ) as there is for the assignment problem, see Wastlund [15] and the references 
therein. As a first step we prove 
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Theorem 1. 

E(L n ) = C(3) + - + o(n" 1 ) 
n 

where 

, , 3(C(2) - C(3)) , ^ C(k, k) 
<* = "! + 2 + l^^TT 

k=3 

where 

c(M) _<*zai£* 

i=0 

is the number of connected graphs with vertex set [k] and k edges. 



Numerical computations show that 0.025 < c\ < 0.040 and these are explained in an appendix. 
This shows that the rate of convergence to ("(3) is order 1/n and is from above. 

2 Proof of Theorem Q] 

We prove the theorem by using the remarkable expression 

E{L n )= f E( K (G n , p ))dp - 1. (3) 

Jp=0 

This is due to Janson [9]. Here n(G ntP ) is the (random) number of components in the random 
graph G n>p . 

To evaluate ([3]) we let K,(k,j,p) denote the number of components of G UjP with k vertices and k + j 
edges in G HtP . We break the evaluation of the integral in ^ into several pieces: We let 

i 1/5 a 1 w ° a 1 , w ° a 31ogn 
ujq = log 1 n and p = -j- and p\ = - + — j- and p 2 = . (4) 



and 



J0 100 



^ and = log 1 / 4 n and fco = win 2 / 3 (5) 

Lemma 2.1. 

r«; toE*>iE(«(*,-i,p))* = c(3) + 3(c( 1; c(3)) +^- 1 )- 
w /ioE*> 3 E(«(A: I o ) p))dp = ^Er=3 t^+oco. 
r°; /p = o Eit>4 Ej°=i em*, j> p))* = 

/ P L P2 E fe >i E J>i0 n^k,j, P ))dp = i-p 2 + oin- 1 ). 

(f) i;io^k>iE j>jo n<k,j,p))dp = o(n- 1 ). 
(9) S P P 1 P0 E k < k0 Z j>j0 E(n(k,j,p))dp = o(n-i). 
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(h) f^ Pl E k>k0 E j>j0 E(K(kJ,p))dp = P2-Pi + oin- 1 ). 

Proof Let C(k,£) be the number of connected graphs on vertex set [A;] with £ edges. Let 
A{k,j)= [ E(n(k,j,p))dp 

Jp=0 

I C(k, k + j) [ p k+j (l - p) k{n - k ^)- k -idp 

'p=0 

k+j)\(k(n-k) + ( k )-k-j)\ 



l C(k,k + j)- 



k J (fc(n-fc) + (*)+l)! 

C(k,k + j)(k + j)\ 



k\ 



x B(k,k + j) 



where 



mk+j) _ n! (t(n- *)+©-*- J)! 



(n-fe)! (fc( n - k) + Q + 1)! 

i rfco 1 (i - j) 

lli=0 I 1 2n fen / 

_ 1 J ( \^ f l ~ \ V 

6XP 1 ^ mn m I ^ I 2 + fc J 

lm=l \i=0 v 7 i=0 / J 

_ 1 ex / tm{k,j) 1 

^m=l ) 

Observe that as Yli=i ^ m — In xTn dx, for £ = k + j we have 

n \ 4^fk + l i-l\ m ^ m ,„ s (k + \ £-l\ m (k-l) m+1 

i=0 v 7 i=0 v 7 

Case (a): l<k<n,j = — 1 (Tree components). 
We have 

h(k,-l) = 3 -^l, (7) 
and for m > 2 

Ufc ,_ 1)£ _^(!i^p_(^ + JL) m ) . ( s) 

V m + 1 \2 2k) ) <0 /c>100 



Thus if A; < n 3//5 we have 



giving 



A(ife,A;-l) = — B(k, k - 1) 
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1 f3(fc-l) /A; 3 

A 3 2/c 3 n \n 2 
Thus 

E A( k ,k-i)= J2 h + \ E ^p 11 + ^ 7/5 )' 

fc<n 3 / 5 fc<n 3 / 5 fc<n 3 / 5 



C(3) 



+ 0(n" 6 / 5 ) + 1 (( 3(C(2) ~ C(3)) ) + 0(n- 3/5 )) + 0(n" 7 / 5 ) 



= C(3) + i(«^®l) + o(n-). (9) 

When k > n 3 / 5 we have from ©, © that B(fc, fe — 1) < k~ k e 3 / 2 . This implies that A(fc, A; - 1) < 
k~ 3 e 3 / 2 . This gives 



e Mk,k~i)< y: v =o ^ 



fc>n 3 / 5 fc>n 3 / 5 

Together with ©, this verifies (a). 

Case (b): l<k<n,j = (Unicyclic components). 

Renyi [13] proved that 

fc-3 



This implies that for large L we have 

C(k,k) 

A. 

k>L 

Now for k < n 1 / 3 we have 



E = °(^- 1/2 )- ( 10 ) 



e 



0(fc/n+fc 3 /n 2 ) ^ 



and 



This gives 



k k+1 n k k+1 n \ k k n 2 



k k+1 n I k 1 / 2 n 2 



E A(k,k ) = 1 - y. ^#+o(»- 11/6 ) 



n 1 — ' k k+l 

AKn 1 / 3 Kn!/3 



= 1 -J2 q ^W- + 0(n^) + 0(n-^% (11) 
fc=i 

where the 0(n~ 7 / 6 ) term is from (|10p with L = n 1 / 3 . 
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For k > n 1 / 3 we observe that t±{k, 0) < 3k/2 in © and t m (k,0) < for m>2 and so 

e 3/2 

and so 



A(k,k)<e^^M 



O 



k k+l n \ k 3 / 2 n 

It follows from this and (fT0|) that 

n 

^ A(fc,fc) = 0(n~ 7 / 6 ). 

fc=nV3 

Together with (fTTj) . this verifies (b). 
Case (c): 

For the cases involving complex components, we use Stirling's formula to approximate 

D(kh{ ■, (fc(n-fc) + (jD-fc-j)» < 
(fc(n-fc) + (*)+!)! 



^ fe(n - fc) + (*) + 1 J (k(n -k) + g) + " 



(l + o(l))exp 



(fc + i)(A:(n-A;) + (5)-fc-j) 



fc(n - fc) + g) + 1 J (fc(n — A) + g) + 



(1 + o(l)) exp J 1 1- < 

|^ fc(ri — k) + g) + 1 J (fc(n - fc) + g) + l) fc +^ +1 " 

(1 + o(l)) exp J 2fc2 + 2 f N 1 1- < 

p 2j+6 



(12) 



(fc(n-fc) + g) + l) fc +i+ 1 ' 
We have the general bound 

C(k, k + j)< min |cr i/2 fc fc+(3j '" 1)/2 , | ( 13 ) 

for some absolute constant c > 0. The first term is from Bollobas [2] and the second counts all 
graphs with k + j edges. 

We deduce from (fT2"j) and 

fc-i 



n k 



(n — k). 



P y n J ^{71,2?! 6n 2 } 



that for 1 < k < n/2 we have 



6fc /* ^ fc 3 I (fc+j)! . y 2 e 2jfcfc+(3j-l)/2 

#,J,San eXP \n 2^ 6^/^^ J (fc(n - (A; - l)/2)*+J+i 
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< ce 7 n fc exp ^ - 1 } (k + jY j 31 2 — - - --7, . 

\ \2n 6n 2 yj - (A; - l)/2) fc +J+ 1 



ce 8 f A; 3 1 / W 2 (fc + j) Y 



To obtain equation (|14j) from the previous line we use that for k < n/2, 
1 



^ 2n) 



6XP 1 2n + n 2 (m + 2)2 m + 2 n m R " 6XP \ 2n + n 2 I ^ (m + 2)2 2m + 2 



< 



k 2 k 3 / 1 1 ^ 1 \ f ^ ll^ 3 1 



(15) 



Suppose next that 1 < j < 100. Then, with u = log 1 / 2 n, 

k=l k=l \ J / k=n 2 / 3 u> 

Assume now that 101 < j < jo- Then for a = ce 8 , j3 = 1/72, 7 = e 3 we have 
^ a f /3fc 3 ) ^ 7 fcV 2 (fc + i) V 

L^^Pj-^jl ,1/2, I ( 16 ) 



fc=l ™ 



<_^_ / n/2 x -3/2 exp (_^ j ( x l/ 2 ( x+j ))^X 



: ji/ 2 ni 7 X=1 ^1 n 2 

n 2 / 3 
3/3V3J/2 



y = (3x 3 /n 2 ; x = n 2 / 3 y 1 / 3 / f3 1 / 3 ; dx = w ?T 2/3 dy 



-n 4 / 3 /3i/V /2 X=o 6 U 1/3 ™ 2/3 / 



: n*/ 3 /3i/ e p/ 2 J v=0 U \ ft/ 3 n 2 H 3 



( r ;%(i/2 - 1/6) + f „-^r0/6 - 1/6)) 



-O 



n 4/3pj/6jj/2 

( a/? 1 / 6 (2 7 p 
I /3i/ 2 n 4 / 3 



We use the fact that the summand in (116ft is log-concave and hence unimodal in order to replace 
the sum by an integral, with an extra factor two. For the final expression we bound the Gamma 
integral T{m) by m m for m = j/2 — 1/6, j'/6 — 1/6 respectively. 

It follows that we have 

^A(kJ)<-^, forj<j . (18) 



fc=l 
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We now consider k > n/2. Any component of this size will q.s0 be the unique giant component 
and therefore have k + O(n) edges, see [2], [10]. In this claim, we can in fact replace k > n/2 by 
k > n 1 " ^. This is required for Case (h). 

Thus the summation limit in (|18|) can be replaced by n. This verifies (c). 
Case (d): 

For j > jo we write 

A(k,j) = A(k,j;l) + A(k,j;2) + A(k,j;3) + A(k,j;4) = 

rpa rpi rP2 /-l 

/ E(«(fc,i,p))dp+ / E(K(k,j, P ))dp+ / E(/s(A:,j,p))dp + / E( K (k,j,p))dp, 

Jp=0 Jp=PQ Jp=P\ Jp=P2 

where po etc. are defined in fl3J), ©. 



We have 

rpi 



J2J2A(k,j;2)< [ P1 -4-dp = ^ = (n- 1 ). 

1^1n t-^iv, ^ P— PO 1 



(19) 



j>jo k>k 

This verifies (d). 
Case (e): 

A standard calculation yields 

P(G njP2 is not connected or has at most n + jo edges) = 0(n~ 3 ) 

and so 

^2^2A(kJ;4)= ! (l-0(n- 2 ))dp = l-p 2 -0(n- 2 ). (20) 

j>j k>l J V=P2 

This verifies (e). 
Case (f): 

We use a proof similar to |10j . Theorem 5.5. Any component with j > 1 contains a subgraph of 
size £ consisting of two cycles; either disjoint and joined by a path, or as a single cycle with a 
chordal path. The structure is obtained by extending a path length I from the end points to any 
two internal vertices. The expected number of such structures is at most 



k>lj>jo t>4 ^ ' 



t>4 

n 2 ^ 

e>4 

= n- 4 / 3+ °( 1 ). (21) 



This verifies (f). 



1 An event £ is said to occur quite surely (q.s.) if P(£) > 1 — 0(n c ) for any constant C > 0. 
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Case (g): 

We deal next with k < ko, j > jo and p € \po,P2\- We have 

E E E (*(^») 

k<k j>j 

<EE(lV^-^^^^ 

k<k j>j V 7 V J/ 

< V V ( ne \ r -k(k-l)/2n k+j -knp+k 2 p/2+((k+3)/2+j)p fc e 

- u ; p V2(fc+j) 



k<k j>jo 



fc<fc i>jo 



e 



_ / r) „ p 2-np+fcp/2-fc/2n \ / P Ph n \l 

<a+°(D)EE — — 2 (^r • 

k<k j>jo V / V 7 

Now n p e 2 - n P+ k o n p/2 < 2 if np > 3, in which case we have 

££eMM,p))<££(^Y< 

k<k j>jo k<k j>jo 

2E (^Y°< 24 „WW"«>. (22) 
fc<fc ^ 7 ^ 7 

Assume then that po < P 5- 3/n. We now use the other expression for C(k,k + j) in (I13p . 

£ E E («(*.i.P)) 
k<koj>jo 



<cj^ ^ f n V fe+j (l -p) fc ( n ~ fc )+(2)- fc -^-J'/2/ ; ,fc+(3i-l)/2 
k<k j>j 

< (c + o(l)) E E ^^ fc e -fcV2n p fc+i e -fcnp+fcV2+((fc+3)/2+j)pj-i/2 fe A ; +^^^ 



< ( C + o(l)) ^ ^ (npe l- P +^/2-fc/2n )fc f Pf * 
fc<feo 3>30 



jl/2 



Assume next that np £ [3/2,3]. In this case we have n pe 1 ~ np+kp / 2 ~ k / 2n < e -np/l00_ g Q) 



,■1/2 J 

<( C + (1)) ^ ^ e -^/100^^_V =rl -naoglo g n) (23) 



E E («( fc .*p)) * ( c +°(!)) E E e ~ fcnp/10 ° ( - 1, 

k<k j>jo k<k j>jo \ 3 J 



k<k Q j>jo 

Now assume that np = 1 + e where |e| < 1/2. In this case we have 

npe l-np+kp/2-k/2n < ^e 2 /2+e 3 /3+efc/2n 
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So, 



£ £E(K(*. J\p)) < ( C + (1)) £ J; e- 2 ^ 3 ^ 2 / 2 " fe^) 

, . . . v — "\ v — "\ f s^k k sk^ 1 ( 16 

<( C +«a))EE-p{-- + - + ^| 



k<k j>jo 

First assume that e > 2&/n and then we can obtain 

ki<k<k j>jo k<ko V S 7 

So assume that e < 2k/n. In which case (|24p gives 



log 1/8 n 



(24) 
(25) 



(26) 



£ E W.i.p)) * + E E^ (r^T 

c<fc j>io k<k j>j V1 ° g n/ 

< feo(c + o(l)) exp |log 3//4 n — J7(logn x loglogn)| . (27) 



This completes the proof of (g) . 
Case (h): 

We finally consider n(k,j,p) for k > ko,j > jo and p £ [pi,f>2]- Then for j > 

E(4,j»)<c 



< — 



r^\j-j/2 k k+(3j-l)/2 p k+j^ 1 _ p ^{n-k)+( h 2 )-k-j 

=d t -w-^«p{^}(^y. 

Now (A/x 1 / 2 )^ is maximized at x = yl 2 /e giving (^/x 1 / 2 ) 31 < e A2//2e for A, x > 0. Also, if uq = 1 
and itj = (^/j 1 / 2 )- 7 for j > then 

Uj+i _ A 



u . e l/2+0(l/j)( J - + 1 )l/2- 

It follows that 

( pe P k 3/2\ j (p^e 2 Pk 3 \ 



j>0 

So, 



53 £E(K(*,j,p)) < ^ 10cp?fc W^)* exp | ^ 1} } • exp {^^} ■ (28) 



fc>^0 i>0 k>k 
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If np > 3/2 and if k\ = min ^jot/ } then, 

E E «(-(*. Art) * E «W* ■ exp + + ^ 

k=k j>j k=k 



ki 

< 10cp 2 k 2 ■ exp 

k=ko 



knp "1 
200 J 



= o(n _1 ). (29) 

As explained in Case (c), any component of size at least k\ will q.s. be the unique giant component. 
Together with (|29jl ■ this implies that 

P2 kl 

£ Yl H<k,3,P))dp = P2~ 3/2n + (l/n). (30) 

P=3/2n fc=fcoi>i() 



Suppose next that np = 1 + e where we have 



Wn 1 

< e < -. 



1/3 - - 2' 



77 

In which case, npe l ~ np < e~ £2//3 . Putting fc = #en where # < 1/10 and iV = s 3 ?i and going back to 
(1281) we have 

en/10 nrlT n 1-\r „j , r en/10 en/10 

-e 2 fc/4 



en/10 en/10 , 2 3 . en/10 en/10 

£ £E«fc,;, P )) < £ cexp {-^ + ^ + ^} < c £ ^ /4 = c £ e ~ 

fc=fco J>0 k=ko k=ko k=ko 



This immediately implies that 

f 3/2n £n / 10 

/ ^ ^E(K(fc,j»)^ = o(n- 1 ). (32) 

J P=Pi k=k j>j 

All that remains is to show that for p £ [pi, P2] there is a unique giant component with k > ko and 
3 > ioi with sufficiently high probability. 

It is known that with probability 1 — 0{n 2 ^ / [n{np\ — l)) 1 / 3 ) = 1 — 0(n -1 / 27 ) the random graph 
G n} p,p > pi contains a unique component of with v > en/10 vertices and > v + jo edges. This 
follows from Theorem 5.10 of |10] , This immediately implies that 



r3/2n 

-pi + o(n ). 

P=P1 fc>en/10j>jo 

This does not quite finish our proof, as we also need an upper bound on the LHS of ([33lh 
Next let 

1 logn 
P3 = - + 



n n 
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If P > P3 then e > n -1 / 3 log?i and e 2 ko = f2(log 2 n) and from (|3ip we see that 

q.s. there is no component of size ko < k < en/10 in G n:P , p G [p3, 3/2n]. (34) 
For completeness we estimate for trees as well: 

en/10 , . 

E E(«(ib,-i,p)) = r ^(l-pfM+lSl-w 



k=ko 



en/10 

< 2 £ ^_ (npe l-n. P+ fc P /2-fc/2n )fc 

en/10 

< 2 n c -e 2 fc/2+e 3 fc/3+efc 2 /2n 

fe=fc 
en/10 

^2„ 

= e -^(log 2 n)^ 

Thus we can work under the assumption that for p G [p3,3/2n] there are no components of any 
type in the range [fco, en/10]. 

We can now argue that for p G [p3, 3/2n], with the requisite probability, G n)P contains no more than 
one component in the range [en/10, n\. For p G [p3, 3/2n] the number of edges m of G n ^ p q.s. satisfies 
m G [(2)^3 ~~ n 2 / 3 ,3n/4 + n 2 / 3 ]. Let m! = m — n 2 / 3 , and m" G [m',m]. We note that q.s. G n ^ m n 
has no components in the range [ko, en/10]. It follows from (131 1) . with e = (logn)/n 1//3 , i.e. p > P3, 
that the probability of this is e ~ n( - log2n \ Also, for any event £, P(G n , m G £) < 0(m 1 ' 2 )T(G ntP G 5) 
where m = (Z)p = 0(n). 

We add the m—m' edges one at a time to create G njJn from G n ^ m > . We cannot create new components 
of size greater than en/10 without creating a component in the range [ko, en/10] on the way. This 
is because en 3> ko- Furthermore, if K\,Ki are components of size greater than en/10 in G nm ii 

1 — Yoo^y ) — e_log 11 • 

It follows that 

E E K^») < e-^ log2n + 1 for p G [p 3 ,3/2n] 

k>ko j>jo 

and so 

f-3/2n 

P=P3 fc>en/10i>io 

We now have to deal with the contribution from p G [pi,P3]« It follows from jlOj . Theorem 5.8(i) 
that in fact 

k>en/10j>jo 

for all p. The random variable Z = ^(1,(2)) hi PS] Theorem 5.8(i) is the number of times in 
the whole graph process that an edge creates a complex component with j = 1 either by merging 
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two unicyclic components, or adding an additional edge within a unicyclic component. Hence Z 
bounds the total number of complex components at any time during the process. Thus, 

E(«»(G„,„)) < 

for any m > 0. Thus 

(5 + o(1))tt 



n^(G n , P )) = Y,n^(G n , m ))n\E(G n , P )\ =m)< 



as well. 
Thus, 



£ ^E(/c(fc 1 j,p))dp = of^). (36) 



k>en/10j>jo 

Equation (h) follows from (|3"0 j) . (|55|> . (|35 |> . (|55 ]> and ([56] ) . 

This completes the proof of Lemma 12.11 and the proof of Theorem [TJ □ 



3 Final remarks 

We have shown that when the X e are uniform [0, 1] then E(L n ) converges to ("(3) with an error 
term of order 1/n. The constant c\ is positive and so for large n we have E(L n ) > C(3)- Steele [3] 
computed E(L n ) for n < 8. E(L n ) increased monotonically. It was natural to conjecture from this 
that E(L n ) increases monotonically for all n. This turns out not to be true. 
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A Estimation of c\ 

We have by direct computation that 



Furthermore, 



™ C (k k) 
0-361 < E ^ * °- 362 - 

fc=3 



C(k,k) 1 Ajt( j 
k k+i 2 Jfe^All k 

i=3 i=l v 

i A r i(i-i 

i=3 k 

1 f k ( x(x-l 



l/8k f (k-l/2)/kV2 

r* 1/2 dy 



2P/ 2 



y=3/2jfc 1 /2 



after substituting y = (x — l/2)/k 1 / 2 

< 



e l/8fc 
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So, 

.361 < y < 362 + e i/80000 R f J_< .362 + 2e 1 / 80000 A /-^- < 0.375. 

- kk+i - V 8 k 3/2 - v 80000 - 

k=3 v fc=10001 

Now 1.6449340 < £(2) < 1.6449341 and 1.2020569 < C(3) > 1.2020570 and so 

3(1.644 - 1.202) 
d > — - + 0.361 - 1 > .025. 

3(1.645 - 1.202) 
ci < -i - + 0.375 - 1 < .040. 
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